Integrating rule and template-based approaches for emotional Malay speech synthesis
نویسندگان
چکیده
The manipulation of prosody, including pitch, duration and intensity, is one of the leading approaches in synthesizing emotion. This paper reports work on the development of a Malay Emotional synthesizer capable of expressing four basic emotions, namely happiness, anger, sadness and fear for any form of text input with various intonation patterns using the prosody manipulation principle. The synthesizer makes use of prosody templates and prosody parametric manipulation for different types of sentence structure.
منابع مشابه
Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملAdding Emotions to Malay Synthesized Speech Using Diphone-based Templates
This paper concerns the addition of an affective component to Fasih, one of the first Malay Textto-Speech systems developed by MIMOS Berhad. The goal is to introduce a new method of incorporating emotions to Fasih by building an emotions filter that is template-driven. The templates are diphone-based emotional templates that can portray four types of emotions, i.e. anger, sadness, happiness and...
متن کاملProsodic Analysis and Modelling for Malay Emotional Speech Synthesis
This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rulebased prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in sev...
متن کاملTemplate-driven Emotions Generation in Malay Text-to-Speech: A Preliminary Experiment
This paper describes the pilot experiment conducted for the purpose of adding an affective component to the first Malay Text-to-Speech (TTS) system, Fasih. The aim is to test a new method of generating an expressive speech via a template-driven system based on diphones as the basic sound. The synthesized expressive speech can express four types of emotion. However, as an initial test the pilot ...
متن کاملEM-HTS: real-time HMM-based Malay emotional speech synthesis
This research aims at developing a real-time HMM-based Malay emotional speech synthesis (EM-HTS) that has the ability to synthesize any form of text input in four different expression which are neutral, anger, sadness and happiness. The quality of the emotional speech synthesis was improved by using Neutral to Angry, Sad, and Happy (NASH) duration generator, which uses context-dependent duratio...
متن کامل